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ABSTRACT 



A set of program elements (e.g., transcoders) are grouped 
together as an administrative unit. Instead of caching the 
individual outputs of each program element, preferably only 
the aggregate output of the set of program elements, taken 
as a whole, is cached. The inventive technique enables the 
effective re-use of intermediate content. In an illustrative 
client-server based implementation involving a transcoding 
service located at a server, the cached information may be 
shared across multiple server instances to obviate redundant 
processing. With the present invention, a caching mecha- 
nism in a complex software system may be extended in a 
user-configurable manner by setting up optimal intermediate 
caching points that are defined by groups of programs used 
in long computations. 

24 Claims, 5 Drawing Sheets 
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METHOD FOR ESTABLISHING OPTIMAL (RE) that uses the rules provided to map a request to a set 

INTERMEDIATE CACHING POINTS BY of plug-ins that then perform the actual content transcoding. 

GROUPING PROGRAM ELEMENTS IN A In generating a response to a given request, a number of 

SOFTWARE SYSTEM intermediate steps (plug-ins) also may be executed. 

5 It is desirable to re-use the output of these intermediate 
steps, because many requests (from different clients) will 

BACKGROUND OF THE INVENTION likely contain steps in common and, because the overhead of 

1 Technical Field regenerating like output can be avoided by providing a 

' . „ caching mechanism. Likewise, a given client may make a 

The present invention relates generally to data manage- request for the same resource lhal was requested m the 

ment in a complex computer system and, in particular, to a recent past For example, assume a user logs in to his 

method of reusing software elements in such a system to accoUQt t0 check the staUls of an online ofder If the order 

facilitate efficient intermediate data caching. status has not cnangedj it should not be neC essary to regen- 

2. Description of the Related Art erate the status page to be returned. The page should have 

Intermediaries are computational entities that can be 15 been cached, with the lifetime of the page being controlled 

positioned anywhere along a data stream, e.g., an HTTP by cache expiration policies. 

stream, and are programmed to tailor, customize, One of the main problems in caching of intermediate data, 
personalize, or otherwise distill or enhance data as it flows however, is the appropriate granularity of caching. If the 
along the st ream. A caching web proxy is a simple example number of intermediate points in execution at which the data 
of an HTTP intermediary. Intermediary-based programming 2Q is cached is not properly balanced with the time taken to 
is particularly useful for adding functionality to a system produce and save the intermediate data, then intermediate 
when the data producer (e.g., a server or database) or the caching is counterproductive to system performance, 
data consumer (e.g., a browser) cannot be modified. The present invention addresses the problem of how to 
A known intermediary architecture and framework, called effectively perform intermediate caching in a complex soft- 
Web Intermediaries (or WBI, pronounced "webby"), is 2 s ware system such asMhe type described above, 
usefiil for creating intermediary applications on the wetx BRIEF SUMMARY OF THE INVENTION 
Basically, WBI is a programmable web proxy and web 

server that, together with a web development kit, may be According to the present invention, a set of program 
used with Java APIs for building web intermediary appli- elements (e.g., transcoders) are grouped together and, for 
cations within the WBI framework. Examples of interme- 30 caching purposes, are administered as a unit. Instead of 
diary applications include: transcoding content between caching the individual outputs of each program element, 
formats, personalization, password and privacy preferably only the aggregate output of the set of program 
management, interactivity, and content filtering. WBI is just elements, taken as a whole, is cached. The inventive tech- 
one example of an architecture for managing web interme- nique enables the effective re-use of the program elements 
diaries. Other conceptual techniques include, for example, 35 and tne intermediate content produced thereby. In an illus- 
chaining of Java-based servlets. trative client -server based implementation, e.g., a transcod- 
Transcoding is the process by which a data object in one m S P rox y located at a server, the cached information may be 
representation is converted into another representation. shared across multiple server instances to obviate redundant 
Typical examples include conversion within media types processing of client requests. 

(e.g., an image encoded in one standard is transcoded into an 40 i{ ^ a s P ecific ob ject of the present invention to 

image encoded in a second standard), as well as conversion group program elements together in a complex software 

between media types (e.g., speech to text). A transcoder is a system by caching an aggregate output so that computations 

functional module, typically in the form of a piece of are not(repeated when unnecessary and excessive caching is 

computer code, that accepts a data stream as input and avoided. It is a particular object of the invention to provide 

produces a data stream as output. The output stream is 45 such a cacliing mechanism in the domain of transcoding data 

created based on the transcoder's input stream and the fiom the Web. 

function that the transcoder is designed to perform. A set of It is another primary object of the present invention to 
"n" transcoders may be chained or otherwise grouped provide a mechanism for grouping a set of related transcod- 
together to form a transcoder proxy framework. In a given ers together for caching purposes. As an illustrative 
transcoder, the transcoding function could be one of any 50 example^group of related transcoders are a set of software 
number of operations such as: a request modifier, a content routines that transcode images from a given image format 
editor, a content generator, or the like. A given transcoder (e.g., gif) to one or more other image formats (e.g., .png, 
output stream may also be based upon an original or jpeg, or the like). Another example of a group of related 
modified request, histographical information on transcoding transcoders includes transcoders that convert data from one 
paths already taken, and/or external preferences (e.g., net- 55 format to another data format. Thus, for purposes of inter- 
work connection speed, client device capabilities, user mediate caching according to the present invention, the 
preferences, and the like) established in a separate database. individual transcoders in a group may have a given rela- 

The WBI backbone provides a well-defined interface, uonship to one another, 

known as a sublayer, for receivin g requests and generatin g In a particular embodiment, a set of transcoders that are 

^responses. In a server supporting the WBI framework, there 60 operative in a transcoding framework and that share a set of 

may exist multiple sublayers, each accepting requests from preference categories (e.g., given network, user or device 

a different source (e.g., a proxy listening on a port, a serial preferences) are grouped together to decrease the frequency 

port, a servlet, and the like) and handling a specific protocol of intermediate caching in the framework. Thus, instead of 

(e.g., HTTP, FTP, POP3, and the like). A sublayer is respon- caching the output of every transcoder, only the group output 

sible for instructing the WBI backbone on how to process a 65 is cached. 

request by providing a set of rules that describe how to According to another feature of the present invention, a 

generate a response. The backbone provides a rules engine group of related transcoders preferably is defined by a 
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system administrator. By enabling administrator control standalone proxy server, as a managed Java servlet under a 

over the granularity of intermediate caching, overall system web server application (e.g., IBM WebSphere) that registers 

performance is spread over many client requests and is thus for all requests, as a callable service (via a direct API), or as 

maximized. an Enterprise Java Bean (EJB) callable from any server-side 

With the present invention, a caching mechanism in a 5 application. While the following description is made in the 

complex software system may be extended in a user- context of a WBI-based transcoding backbone, this is not a 

configurable manner by setting up optimal intermediate limitation of the present invention. The concepts described 

caching points that are defined by groups of programs used below are separable and may be implemented in other 

in long computations, thereby balancing space and time servlet-based environment or otherwise coded directly into 

resources in the system. 10 a server program or as a plug -in to such a program. 

The foregoing has outlined some of the more pertinent Referring back to FIG. 1, a given transcoder 102 is 

objects and features of the present invention. These objects registered with the backbone by an administrator either 

should be construed to be merely illustrative of some of the statically (e.g., in a properties file 104) or dynamically (via 

more prominent features and applications of the invention. a console 106 at the server). Alternatively, the transcoder 

Many other beneficial results can be obtained by applying 15 ma Y be registered via a remote Java-based GUI. Once 

the disclosed invention in a different manner or modifying registered, one or more transcoders are loaded when the J 

the invention as will be described. Accordingly, other backbone is initialized (e.g., started from a command line, 

objects and a fuller understanding of the invention may be initializing a servlet wrapper, or the like, depending on the 

had by referring to the following Detailed Description of the backbone configuration). After the individual transcodersO 

Preferred Embodiment. 20 are loaded, the backbone 100 may query the transcoders via \ 

interfaces 114. 

BRIEF DESCRIPTION OF THE DRAWINGS Intermediate caching of data in the WBI framework is 

For a more complete understanding of the present inven- desirable if such caching can be appropriately balanced with 

tion and the advantages thereof, reference should be made to 25 * e time ta f? to f. roduc ^ the intermediate data, 

the following Detailed Description taken in connection with ™ e P/^Qt invention solves thus problem by associating ^ f , 

the accompanying drawings in which: f & t ofMEGs into one or more MEG (or transcoder) groups. ■ > 

.„ . . Instead of cachmg the outputs of each MEG in a given 

FIG. 1 is a block diagram illustrating a transcoding group? only the final output is cached FIG 2 iUustrates a } 

backbone for use in registering, loading, initializing and pre ferred mechanism for use in providing this group func- {vu^JlA 

managing a set of transcoders; , 30 Uonality according to the invention. V f 

FIG. 2 is a simplified block diagram of a group admin- In particular, a group administration manager 200 is 

istration mechanism of the present invention; responsible for grouping together of MEGs into MEG 

FIG. 3 is a representative graphical user interface dialog groups. A set of MEG groups may be executed according to 

for use in enabling a system administrator to assign given 3S a set of execution priorities stored in the database 202. 

transcoders to one or more groups according to the present Further, a given MEG also may have associated therewith a 

invention; number of dependencies, e.g., what network rjrefereBCfcS-the 

FIG. 4a is a block diagram of a WBI client session MEGJs^depende^oj^^to user preferences JheMQE^Lis 

sublayer in which the present invention may be imple- ^de^gnaent^nrwha 

mented; 40 dent on, and^so-forth^ Representative network preferences^ 

FIG. 4b is a block diagram of a WBI transaction session ^or-exam^rCinclude supported and unsupported extensions, 

sublayer for use in conjunction with the client session images, compression source, and the like. Representative 

sublayer of FIG. 4a; user preferences include a text vs. image link preference, a 

T7i/- c • i f i 4 tv P e °f desired source compression, how images are 

FIG. 5 is a block diagram of a complex software system /. , - * . * - ' . 1<1 6 n 

illustrating a variety of programs that work in sequence and 45 ^P oseA ' P' eferred "T^ scahn S f f tors > or th / ^ Re P- 

in parallel resentative device preferences include supported and unsup- 

„_ , !„ t „ ported formats, device types, desired content types, and the 

FIG. 6 illustrates how a group of program elements are like . dependencies are stored, preferably as (key,^> 

aggregated together for reuse according to the present inven- vahie) airs> in a ^ah^ 204. 

tion; and . , . t ~777^"~~~- 1 , ^ 

m .„ . , * , . . <* . 50 Aplug-in writer, if he or she chooses to take advantage of 

FIG. 7 illustrates another example of the principles of the the ioventive caching techniques, may provide the prefer- 

present invention in the context of a web transcoding ence information when the me G fe authore d. The adminis- 

apphcation. trator who msta Us me MEG then uses an interface 206 of the 

group administration manager 200 to assign the MEG to a 
55 given group, to identify whether or not the output of a 
particular group should be cached, to define the execution 

By way of brief background, and with reference to FIG. order °f tnc MEGs within a given group, to define the y 

1, the present invention may be implemented in a transcod- execution order of the MEG groups, L and so_forth.-The~ ^ 

ing backbone 100 into which individual transcoders interface 206 may also be used to create new groups, to alter 

102a-102/i are installed and managed. Aknown transcoding 60 or modify the preferences of a given MEG, and the like. The 

backbone is WeB Intermediaries ("WBI" or webby). Agiven interface 206 may be a graphical user interface (GUI), a 

transcoder in WBI is sometimes referred to as a MEG, which command line interface, or any other convenient implemen- 

identifies the particular function (e.g., request Modifier, tation. A representative GUI implementation is described in 

content Editor, content Generator) that the transcoder per- more detai l below. 

forms. Typically, a transcoder comprises a set of software 65 In the preferred embodiment of the invention, MEGs that 

routines. The transcoding backbone 100 may be imple- have related functionality are grouped together in a MEG 

mented in any number of convenient ways, such as a group. In this way, program elements within the software 
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system are made easily reusable. Thus, for example, a given FIG. 3 illustrates a representative GUI interface that may 
set of related MEGs may be those plug-ins that change one be used to implement the group administration functionality 
image format to another, e.g., .gif to .png or .jpeg, those of the present invention. The illustrative display panel is a 
plug-ins that change one data type to another, e.g., AFP group administration panel 300 that includes a Select Group 
(Advanced Function Printing Language) to SVG (Scalable 5 Order display field 302, an Assign Transcoders display field 
Vector Graphics, a W3C standard XML 2D graphics 304, and an Available Transcoders field 306. Using a con- 
language), Postscript to HTML, or other such convenient ventional drag-and-drop mechanism (or any other conve- 
relationships. Related MEGs may be those that convert data nient user I/O technique), the administrator assigns available 
types without regard to device, user or network preferences, transcoders to groups and orders the groups for execution, 
or those that convert one markup language to another. A 10 As noted above, the assignment of transcoders into group(s) 
group may comprise MEGs that share preference categories, may be done dynamically as well. In the illustrative 
given preferences, or the like. Still another type of MEG example, Group C is positioned for execution before Groups 
group may be various MEGs involved in creating a docu- A and B. Group C includes transcoders that convert without 
ment object model (DOM) representation of given content regard to device/user/network preference from AFP to either 
within the W3C DOM guidelines. Of course, the present 15 HTML or SVG. Group A has only a single transcoder that 
invention is not limited to any particular type of relationship converts from SVG to HTML. Group B takes into account 
with respect to a given MEG group. Referring back to FIG. device/network/user preferences and reduces HTML to 
2, to support the MEG group functionality, a MEG typically WML, and HTML to VoxML (Voice Markup Language, 
provides given information to the group administration another XML standard). The available transcoders (marked 
manager via the following methods: 20 as Ungrouped) include an AFP to Postscript transcoder, an 

(a) String getgroup( )— get logical group name MEG HTML to PvC HTML (for pervasive computing devices that 
belongs in; cannot handle full HTML functionality), and a Xerox to AFP 

(b) String[ ] getDependencies( )-gct logical names of transcoder. Of course, all of the above transcoder functions, 
categories this MEG is dependent upon; and as wel1 35 meir representative groupings are merely illus- 

(c) boolean isNeverCached( ^possible placeholder to 25 ^ive. The group atoimstrata 

optimize owning group's ability to cache. n0t / imited ^TtfZ transcodin g ^ction(s) 

TTie following methods are then provided to the Execution 01 ' * ™ y P^r MEG groups ^group events 

p rf ^™ r • , r __j' The user dialog also includes a Create New Group button 

Processor in the transcoding service: - AO 0 , r . . , _ j. , / . \ 

, v „ . „ , x 308. Selection of this button opens up a dialog (not shown) 

(a) Enumeration enumerate Groups( ); 30 that includes a list of the available transcoders. From this 

(b) String [ ] getDependenciesForGroup( ) — creates union fist, the administrator can create a custom group and then 
of all dependencies MEGs in group have returned; savc the results. By selecting a given transcoder (e.g., from 

(c) ExecuteMEGLoop (Group, RequestObject) the group dialog 300 of FIG. 3 or from the Create New 
Whenever Enumeration appears, it may be an object that Group dialog) the administrative interface opens up a 
wrappers or implements its own Enumeration. In addition, 35 transcoder dialog (not shown) from which the administrator 
when the group functionality is used, the administrator may define given characteristics of the transcoder, modify 
should ensure that the first group that is selected produces a network, user or device dependencies, and the like, 
response (i.e., generates an output). This is a default require- According to the present invention, intermediate caching 
ment. of group output data is used to facilitate reuse of MEGs in 

Further, to implement this functionality, a registerAdmin 40 a server that includes a transcoder service. An illustrative 

(Admin) method in the transcoding service is added to the WBI framework that utilizes the present invention is now 

Proxy class to receive notification of administrative infor- described. 

mation. The caching and session management implementa- FIGS. 4a-Ab illustrate a transcoding service that is 
tion of Run.java (described below) creates Proxy loading operative, for example, at a web server. This service pro- 
plug-ins (but not sublayers) from property files. Then, a 45 vides client session management and caching within the 
sublayer is created (which also administers Admin), and this context of a WBI-based backbone. To this end, two special- 
sublayer is registered as an administrator to the WBI back- ized sublayers, referred to as a client session sublayer and 
bone and as a sublayer to the Proxy. The sublayer receives the transcoding session sublayer, respectively, are utilized, 
notification of MEG registrations and passes the information Generalizing, a sublayer Ksjejis^Qn ^one or moje-pQjtsJgr 
to an Execution Processor to make appropriate groupings 50 inc oming^ requests from a clie nt (such as a browser) and 
using the methods set forth above. Upon being queried, the propagates the request through the backbone and registered 
group administration manager returns an enumeration plug-ins that produce a response to be returned to the client, 
object, namely, enumerateReverseOrderedGroups( ), As will be seen, a given sublayer utilizes a caching proxy to 
wherein the last group to execute is the first in the list. cache both intermediate transcoding steps as well as final 
Thus, according to one embodiment of the present 55 output so that it is not necessary to regenerate output each 
invention, group priorities are preferably administrator- time a request for a given resource is received. The group 
defined and MEGs are called during initialization at their caching techniques of the present invention are used for this 
getGroup( ) interface method to get the logical group name purpose. 

for the group to which they belong. By using the The client session sublayer 400, which is illustrated in 
hasMoreElements( ) method and the nextElement( ) method, 60 FIG. 4a, is responsible for identifying client sessions. Client 
the groups can be obtained. sessions may be identified, for example, using cookies or 
Alternatively, one of ordinary skill in the art will appre- some other implementation. The transcoding session sub- 
ciate that the MEGs may be called into their groups in a layer 450, which is illustrated in FIG. 4b, is responsible for 
dynamic fashion by the group administration mechanism. In caching both intermediate transcoding steps and final output, 
particular, a given MEG group may be defined and generated 65 Preferably, an existing caching proxy 402 is used for this 
"on-the-fly" (i.e., dynamically) as the transcoding service functionality, as there are many such proxies readily avail- 
adapts to different operating conditions. able. As is well known, a caching proxy works via a pull 
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mechanism. In particular, a resource is requested from the 
cache and, if available, it is returned; otherwise, the cache 
fetches the resource itself and then returns it to the client. 
This behavior requires that all requests for a resource be 
made through the cache. 5 

In a preferred WBI environment such as illustrated in 
FIGS. 4a-4b, a URL rewriting scheme is employed in 
response to a given client request. According to the 
technique, when an original request URL is received at the 
client session sublayer 400, this URL is modified such that 10 
it points to the transcoding session sublayer 450. The 
rewritten URL encodes the original URL as well as all of the 
transcoders (i.e., MEGs) that must be run to satisfy the 
request. The client session sublayer 400 then requests this 
rewritten URL through the cache 402 (a proxified request), 15 
and upon a cache miss (e.g., the first time the resource is 
requested), the transcoding session sublayer 450 invokes the 
proper MEGs to generate the response. Because the request 
was made through the cache 402, the response generated by 
the transcoding session sublayer 450 gets cached and is 20 
returned directly by the cache for all subsequent requests for 
that same resource (subject to cache expiration policies). 

The client session sublayer 400 and transcoding session 
sublayer 450 work in concert to perform content transcoding 
via the WBI backbone and to cache the output of the 25 
transcoding for re-use on subsequent requests. Preferably, 
both the client session sublayer and the transcoding session 
sublayer use the HTTP protocol. 

Referring now to FIG. 4, the Client Session Sublayer 400 
includes a Protocol Interpreter 404 at its input and a Protocol 30 
Encoder 406 at its output. The Protocol Interpreter 404 
listens on a dedicated port for HTTP requests from clients 
and works with a Session Manager 408 to create and/or 
identify existing client sessions. Annotation (a) in the figure 
indicates that the Protocol Interpreter provides a client key 35 
if it is provided as part of the protocol. A default session is 
used if no session key is provided. The client session 
sublayer also includes a pair of rule engines (REs) 409 and 
410. The sublayer propagates requests through the WBI 
backbone through a registered Requestlnterpreter. 40 

When a request is received by the sublayer, it creates 
RequestHandle and ResponseHandle objects. The Request- 
Handle encapsulates the InputStream from the socket 405 
that was created when the sublayer accepted the connection 
(serverSocketaccept( )) and the Protoco Interpreter. The 45 
ResponseHandle encapsulates the OutputStream of the 
socket and the ProtocolEncoder. The backbone uses the 
Protocol Interpreter 402 to 'interpret* the request Input- 
Stream into a form usable by the registered MEGs. If the 
Protocol Interpreter 402 finds a cookie or other session 50 
identifer in the InputStream, it asks the Session Manager 408 
to return a Session Object that represents that client session. 
Annotation (b) in the figure indicates that the client key may 
be derived from a cookie implementation. Annotation (f) 
indicates that the Session Object is added to the request 55 
specific data maintained by the RequestHandle object. Once 
an existing session, if any, is identified, the Client Session 
Sublayer queries the Group Administration Manager 412 
and its associated database 414 of group execution priorities. 

As described above, group priorities preferably are 60 
administrator-defined. As reflected by annotation (e), Given 
MEGs are called during initialization at their GetGroup( ) 
interface method to gel a logical group name to which they 
belong. The Group Administration Manager 412 returns the 
MEG groups, preferably in reverse order; i.e., the first group 65 
returned preferably is the last group to be executed. For each 
group returned, the Client Session Sublayer queries a Pref- 



,876 Bl 

8 

erence System 416 and its associated database 419 for a list 
of dependencies (preference values). 

These dependencies which, as described above, are pref- 
erably (key, value) pairs, are then compressed into a single 
string by the Preference System 416. Annotation (c) in the 
figure indicates that preference keys are usually homog- 
enous across all server instances. These dependencies pref- 
erably are the union of all dependencies of the individual 
MEGs within the group to which the MEGs belong. In an 
representative embodiment, these dependencies are 
retrieved from the Preference System via the 
getDependencies( ) interface method. Annotation (i) in the 
figure indicates an optional test of the client Session Object 
at the point indicated. 

Once all of the groups and their dependencies are 
obtained, a URL rewrite is performed by Generator 418. The 
new URL generated includes the original request, the MEG 
groups and their dependencies, and a cookie (or other 
session identifier) if one was obtained from the Session 
Manager 408. This new URL is then requested from the 
transcoding session sublayer 450 (in FIG. 4b) via a cache 
interface 420. Annotation (d) indicates that Generator 418 
preferably uses an abstract class, such as ByteStore, to allow 
for sliding window buffering and other features. 

As an illustrated example, the new URL may be of the 
following form: 

GET http://TSS:8000/www.cnn.com/groupl/ 
compressed datal/ group2/compressed_data2. 
If the resource represented by this URL is currently in the 
cache 402, the resource is returned directly from the cache 
and passed to the ProtocolEncoder 406 to be encoded and 
returned to the client via the OutputStream contained in the 
ResponseHandle. If the resource is currently not cached, 
however, the cache 402 forwards the request to the 
Transcoding Session Sublayer 450. As will be seen, the 
Sublayer 450 then propagates the request through the back- 
bone to generate the proper response. 

When a request is finally complete and returned from the 
Transcoding Session Sublayer, the response may have a 
cookie that must be stored. In such case, the Client Session 
Sublayer 400 passes the cookie to the Session Manager 408 
so that this client may be joined into a session. Subsequent 
requests by the same client would then be recognized by the 
Session Manager 408 as described above. 

In FIG. 4, the Client Session Sublayer includes another 
editor transcoder 422. As indicated by annotation (g), the 
transcoder may be used to implement and administer an 
optional client caching policy that is different from an origin 
server caching policy. The client caching policy may be set 
up by an administrator. This allows for retranscoding on a 
given time granularity that is controlled at the transcoding 
proxy level. Referring now to FIG. 4b, the Transcoding 
Session Sublayer 450 listens on a dedicated port for requests 
from the Client Session Sublayer 400. As noted above, the 
function of the Transcoding Session Sublayer 450 is to 
actually propagate the request through the WBI backbone to 
achieve the desired transcoding. The Transcoding Session 
Sublayer shares many of the components of the Client 
Session Sublayer, as can be seen. 

Given the input request of the form set forth above, the 
Transcoding Session Sublayer 450 determines that the URL 
contains more than one group name. Therefore, the Sublayer 
first builds a new URL, preferably by removing the last 
group name and data. The resulting URL is: GET http:// 
TSS:8000/www.cnn.com/groupl/compressed_datal. The 
Sublayer then requests this new URL through the cache 
interface. As long as there is a cache miss, the Transcoding 
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Session Sublayer recursively requests URLs in this manner Program A 604 generates input data AB 605 and input data 

through the cache 402. In the case where this is the first AD* 607 for program B 606 and program D 608, respec- 

request for this particular set of MEG groups, no cache hits tively. Program B 606 generates data BC 609 for program C 

will occur. Thus, as soon as the Sublayer notices that the 610. Program C 610 and program D 610 together contribute 

URL contains only a single MEG group/data pair, it sends 5 to tne production of data CD *F 612 as input to program E 

the request into the backbone to generate the content. 614 Program E 614, in turn, generates data EF* 616 for 

Because the request was originally made through the cache program F 618. Program F produces final output G* 620. 

interface 402, however, the OutputStream finally returned As can be seen by comparing FIGS. 5-6, programs B and 

by this last MEG group gets cached. It is available as input C u are executed twee even though their input and output are 

for the prior MEG group. As the chain unwinds, each 10 th K c sa * e across f both operations As has been described 

i • . - m • . r * t . t . c above, the present invention obviates such redundant execu- 

intermediate transcoding step (i.e the output of a given ^ Q ' f ^ b grouping the elements 

MEG group) gets cached and thus is available should hef (ag b ^ ^ ^ £ nQ 6) and 

subsequent requests require that same output. As indicated cacM on[ the ^ t of the efltire instead ^ (he 

by annotation (h), this step is a recursive request to the outpilt of program B md the output of program c 

Transaction Sublayer that still pulls through the cache and 15 FIG 7 in usl rates a more concrete example of the present 

either comes to prev-null and unwinds or gets an intenne- invention. In this example, the input 701 is an HTML 

diate result from cache and unwinds from that point. As document with a gif image. In response to a first client 

indicated by annotation (j), if generators retrieve origin data request for this input, the output 703 returned to the first user 

via HTTP, they should do so through the cache interface so is a PDF document with a jpeg image in place of the .gif 

that original (i.e., untranscoded) resources are cached as 20 image. In response to a second client request, the output 705 

well. In FIG, 46, the diamond "1" represented by reference returned to the second user is a SVG document with the .jpeg 

numeral 460 is a decision point that determines the subse- image. In this example, the group 702 illustrates the ele- 

quent path through the system. When the final output has ments that are aggregated together for caching purposes, 

been generated, the ProtocolEncoder can be invoked to preferably using the group administration functionality pre- 

return a result to the requester. As described above, the 25 viously described. The group 702 comprises a first program 

requester in this case was actually the Client Session Sub- element 704 that parses the .gif image, a second program 

layer 400. The Client Session Sublayer then returns the element 706 that converts the .gif image to a .png format, 

resource to the real client, namely, the requesting client and a third program element 708 that converts the .png 

browser. This completes the processing. image to a .jpeg format. Only the output 707 of the group 

Many variants are within the scope of the present inven- 30 (namely, the third program element 708) is cached and then 

tion. Thus, for example, when a MEG within a group throws reused to facilitate responses to the individual client 

an exception to fail, a request execution controller may skip requests. 

other MEGs in the same group. Also, the group administra- As has been previously described, to support group 

tion functionality may be augmented to preconfigure group administration, the MEGs used in the transcoder framework 

rules so that entire MEG groups can be skipped under given 35 should have associated with them the following: a logical 

circumstances. group name to which a MEG belongs, logical names of 

Thus, according to the present invention, software ele- categories a MEG is dependent on, and (optionally) an 

ments are grouped together with the output of the group indication that a MEG is never cached (either because it 

cached for reuse. The grouping together of software ele- contains personalized data or has some other security 

ments is desirable to reduce the amount of cache space used 40 implication). Preferably, MEG works in conjunction with 

while, at the same time, ensuring that the cache is not the group administration mechanism. The MEG author 

overburdened. The output of a given group is kept in cache should provide adequate documentation of the MEGs when 

and reused in future computations. A further illustration of he or she supplies plug-ins so that intelligent decisions can 

the inventive concept is now described with respect to FIGS. be made by the group administrator on classifying the 

5-7. 45 MEGs. 

FIG. 5 illustrates a conventional complex software system The present invention provides numerous advantages 
500. For purposes of illustration only, input data A 502 is the over the prior art. With the present invention, a caching 
input to program A 504. Output G 520 is the final output of mechanism in a complex software system may be extended 
the system. Program A 504 generates input data AB 505 and in a user-configurable manner by setting up optimal inter- 
input data AD 507 for program B 506 and program D 508, 50 mediate caching points that are defined by groups of pro- 
spectively. Program B 506 generates data BC 509 for grams used in long computations, thereby balancing space 
program C 510. Program C 510 and program D 508 together and time resources in the system. Preferably, MEGs are 
contribute to the production of data CDF 512 as input to grouped according to whether or not the individual MEGs 
program E 514. Program E 514, in turn, generates data EF share a given relationship to one another. The particular 
516 for program F 518. Program F produces final output G 55 relationship may be based on image formats, data formats, 
520. network, device or user preferences, or the like. 

In many web-based scenarios, the Input Data 502 com- Intermediate caching of image transcoding according to 

prises two elements, a fixed element and a variable element. the present invention increases system performance consid- 

For example, the Input Data A may comprise web page data erably. Suggested grouping of MEGs may include image 

together with user preferences on how the web page data is 60 groups whenever appropriate, as it is from the caching of 

to be transcoded for display (e.g., without images). images that significant performance advantages may be 

Typically, the web page data is a fixed element and the user achieved during the operation of the transcoding framework, 

preferences are variable elements. FIG. 6 illustrates the Typically, image data form a substantial portion of Web 

computer system of FIG. 5 where another set of variable traffic (well over 50% and maybe as much as 70%) if size is 
elements is used as initial input (600). For purposes of 65 the determining criterion. Thus, caching image groups and 

illustration only, input data A* 602 is the input to program reusing them in transcoding according to the invention 

A 604. Output G* 620 is the final output of the system. derives significant performance advantages. 
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The present invention may be used with third party "pull" 
caching solutions (e.g., Squid, WTE, and the like) so that 
users may take advantage of existing caching software. 
When used in the WBI framework, existing transcoders do 
not have to be modified to cause the final output of a group 5 
of such transcoders to occur. In effect, these caching points 
are added without overhead or expense by simply installing/ 
configuring the new pluggable components on top of the 
"core" WBI codebase. Thus, transcoder developers that 
desire to use the caching design need not provide additional 
interfaces within their code to cause these caching points to 
occur. Instead, simple configuration via the administration 
panels and additional plug- in installation interfaces on top of 
well-known WBI plug-ins provide enough information to 
specify these caching points. 

A given transcoder comprises software, i.e. a set of 15 
program instructions, executable in a processor. A represen- 
tative processor is x86-, Pentium-, PowerPC®- or RISC- 
based, and that includes an associated operating system. A 
representative computing platform is an IBM S390 and 
AS400. 20 

The present invention may be implemented in a conven- 
tional client -server computing environment. A given client 
machine and the server may communicate over the public 
Internet, an intranet, or any other computer network. If 
desired, given communications may take place over a secure 25 
connection. Thus, for example, a client may communication 
with the server using a network security protocol, such as 
Netscape's Secure Socket Layer (SSL) protocol or the like. 

A representative client is a personal computer, notebook 
computer, Internet appliance or pervasive computing device 30 
(e.g., a PDA or palm computer) that is x86-, PowerPC®- or 
RISC-based. The client includes an operating system such as 
Microsoft Windows, Microsoft Windows CE or PalmOS. As 
noted above, the client includes a suite of Internet tools 
including a Web browser, such as Netscape Navigator or 35 
Microsoft Internet Explorer, that has a Java Virtual Machine 
(JVM) and support for application plug-ins or helper appli- 
cations. 

With reference to FIG. 1, a representative web server is an 
IBM Netfinity server comprising a RISC-based processor 40 
121, a UNIX-based operating system 123 and a web server 
program 125. Suitable I/O 106 devices and routines are also 
provided to facilitate the administration functionality 
described above. Of course, any convenient server platform 
(e.g., Apache, WebSphere, or the like) may be supported. 45 
The server may include an application programming inter- 
face (API) 129 that provides extensions to enable applica- 
tion developers to extend and/or customize the core func- 
tionality thereof through software program including plug- 
ins, CGI programs, servlets, and the like. 50 

The group administration mechanism of the present 
invention may be implemented in software executable in a 
processor, namely, as a set of instructions (program code) in 
a code module resident in the random access memory of the 
computer. Until required by the computer, the set of instruc- 55 
tions may be stored in another computer memory, for 
example, in a hard disk drive, or in a removable memory, or 
downloaded via the Internet or other computer network. 

In addition, although the various methods described are 
conveniently implemented in a general purpose computer 60 
selectively activated or reconfigured by software, one of 
ordinary skill in the art would also recognize that such 
methods may be carried out in hardware, in firmware, or in 
more specialized apparatus constructed to perform the 
required method steps. 65 

One of ordinary skill in the art also will appreciate that the 
caching technique of the present invention may be general- 
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ized for use with any software system having a plurality of 
object-based program elements. Thus, the invention is not 
limited for use in a WBI framework having transcoders. 

Having thus described our invention, what we claim as 
new and desire to secure by letters patent is set forth in the 
following claims. 

What is claimed is: 

1. A method of caching in a software system having a 
plurality of transcoders, comprising the steps of: 

associating into a group a set of related transcoders; and 
caching an aggregate output of the group only upon 

completion of the execution of all of the related 

transcoders. 

2. The method as described in claim 1 wherein the 
software system is a web server having a transcoding 
service. 

3. The method as described in claim 2 wherein a given 
transcoder is a request modifier. 

4. The method as described in claim 2 wherein a given 
transcoder is a content editor. 

5. The method as described in claim 2 wherein a given 
transcoder is a content generator. 

6. The method as described in claim 1 wherein each of the 
related transcoders in the group process images. 

7. The method as described in claim 1 wherein each of the 
related transcoders in the group process data. 

8. The method as described in claim 1 wherein each of the 
related transcoders in the group share a given preference 
category. 

9. The method as described in claim 1 wherein each of the 
related transcoders in the group share a given preference. 

10. A method of caching in a software system having a 
plurality of transcoders, comprising the steps of: 

associating sets of transcoders into one or more groups; 
responsive to a first client request, selectively caching an 

aggregate output of all of the transcoders in the given 

group; and 

responsive to a second client request, using the aggregate 
output to generate a response to the second client 
request. 

11. The method as described in claim 10 further including 
the step of: 

identifying a given priority for each of the one or more 
groups. 

12. The method as described in claim 11 further including 
the step of: 

executing the one or more groups according to the given 
priority. 

13. The method as described in claim 11 wherein at least 
one group includes a set of preference values. 

14. A method of caching in a software system having a 
plurality of program elements, comprising the steps of: 

associating sets of program elements into one or more 
groups; 

responsive to a first client request, selectively caching an 
aggregate output of all of the program elements in the 
given group; 

responsive to a second client request, using the aggregate 
output to generate a response to the second client 
request; and 

identifying a given priority for each of the one or more 
groups, 

wherein at least one group includes a set of preference 
values; and 

wherein the preference values are selected from a set of 
values consisting of user preferences, network prefer- 
ences and device preferences. 
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15. A method operative at a server, comprising the steps 

of: 

in response to a client request, generating a URL that 
identifies a set of transcoder groups, wherein each 
transcoder group comprises a set of administrator- 
defined transcoders; 

executing the transcoder groups to generate an output 
stream; and 

caching for subsequent reuse an output of a given inter- 
mediate transcoding step performed by one of the 
transcoder groups. 

16. The method as described in claim 15 wherein the 
transcoder groups are executed according to a given priority. 

17. The method as described in claim 16 wherein the 
given priority is user-defined. 

18. The method as described in claim 15 wherein each of 
the transcoder groups has a set of preference values asso- 
ciated therewith. 

19. The method as described in claim 18 wherein the set 
of preference values include a preference selected from a 
network preference, a user preference and a device prefer- 
ence. 

20. A computer program product in a computer readable 
medium for setting up intermediate caching points in a 
software system having a plurality of transcoders, compris- 
ing: 

means for associating sets of given transcoders into one or 

more groups; 
means for prioritizing the groups for execution; and 
means for defining a set of preference values for each 

group. 

21. A computer program product in a computer readable 
medium for caching data in a software system having a 
plurality of transcoders, comprising: 

means for associating a set of related transcoders into a 
group; 

means operative during the processing of a client request 
for caching for reuse and aggregate output of the set of 
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related transcoders, wherein an individual output of at 
least one of the set of related transcoders in the group 
is not cached. 

22. A computer system for providing a transcoding 
service, comprising: 

a set of transcoders; 
a cache; and 

a group administration mechanism for setting up inter- 
mediate caching points, comprising: 

means for associating a set of related transcoders into a 
group; and 

means operative during the processing of a client request 
for storing in the cache for reuse an aggregate output of 
the set of related transcoders, wherein an individual 
output of at least one of the set of related transcoders in 
the group is not cached. 

23. A computer system, comprising: 
a set of transcoders; 

a cache; and 

a group administration mechanism for setting up inter- 
mediate caching points, comprising: 
display means for enabling a system administrator to 
associate a set of client requests for storing in the 
cache for reuse an aggregate output of the set of 
related transcoders. 

24. A computer system, comprising: 
a set of transcoders; 

a cache; and 

a group administration mechanism for setting up inter- 
mediate caching points, comprising: 
means for dynamically associating a set of related 
transcoders storing in the cache for reuse an aggre- 
gate output of the set of related transcoders. 
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